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ABSTRACT 

Fast Fourier Transform plays a very important role in many engineering applications. Many different algorithms 
are proposed to improve the architecture of FFT among them Split-radix FFT is a particular FFT algorithm that aims to 
compute FFT with the least number of multiplications. The split-radix FFT mixes radix-2 and radix-4 decompositions, 
yielding an algorithm with about one-third fewer multiplies than the radix-2 FFT. The split-radix FFT has lower 
complexity than the. radix-4 or any higher-radix power-of-two FFT. This paper proposes split-radix FFT for both real and 
floating point numbers. In this paper we use verilog for code implementation. The FPGA synthesis and logic simulation is 
displayed in Xilinx design suitl3.2. The results shows the improvement in the speed 
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INTRODUCTION 

Fast Fourier Transform (FFT) has become ubiquitous in many engineering applications. High-speed FFT 
operations are necessary in the field of digital signal processing, digital image processing applications and also in several 
communication systems. Fast Fourier Transform (FFT) is an efficient algorithm to compute the discrete Fourier transform 
(DFT) and it's inverse. It makes the use of the symmetry properties of twiddle factor to effectively reduce the DFT 
computation time .It is based on fundamental principle of decomposing the computation of DFT of a sequence of length N 
into successively smaller discrete Fourier transforms 

The N-point DFT of sequence x (n) is given by 

X (K) = Y.n=l x(n) 0< K< N-l 

Where W N =e~^ 2n ^ N is known as twiddle factor 

The efficiency of the FFT algorithm lies in its reduced number of arithmetic operations. DFT has the order of 
O (n 2 ) arithmetic operations whereas FFT has the order of O (N log N) arithmetic operations. The Cooley - Tukey 
algorithm was proposed in 1965 which is used to divide the transform into two pieces of size N/2 at each step, and is 
therefore limited to power of two sizes .this algorithm usually called radix-2 algorithm 

Radix-2 Algorithm 

In this radix-2 algorithm N-point sequence decimates into two sequences of length N/2, where one sequence 
consisting of the even-indexed values of x (n) and other odd indexed values of x(n) 

DIT radix-2 representation is 
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X (K) = DFTfeven numbered indices]+ DFT[odd numbered indices] 

DIF radix -2 representation is 

X (2K) = E^o _1 [ x (") + x (n + *)] W$, K=0,1 N/2-1 

X (2K+1) = jW^l x(n) - x (n + 0] W N " W# 2 k=0,l N/2-1 

Radix-4 Algorithm 

When the number of samples N in the input sequence is a power of 4 (, it is more computationally efficient to 
employ a radix-4 FFT algorithm. The radix-4 decimation-in-frequency (DIF) FFT are as follows. we subdivide x (k) into 
four N/4-point sub sequences x(4k), x(4k+l), x(4k+2) and x(4k+3) of each length 0, 1,...., N/4-l. 

So, the radix-4 DIF DFT are obtained as follows 

X (4K) = 2^o -1 [ x(n) + x (n + J) + x (n + ") + x(n + \ N)] W° W# 4 

X (4K+1) = E^t 1 [ x(n) - jx (n + J) - x (n + + jx(n + \ N)] W N n W N nk 

X (4K+2) = E^t 1 [ x(n) - x (n + J) + x (n + g - x(n + \ N)] W N 2n W N nk 

X (4K+3) = S^o" 1 [ x(n) + jx (n + 3 - x (n + 9 - jx(n + j N)] W N 3n W# 4 
Split-Radix FFT 

The split-radix algorithm, first clearly described and named by Duhamel and Hollman in 1984, required fewer 
total multiply and add operations than any previous power-of-two algorithm. Split-radix FFT is one of the FFT algorithms 
that use combination of different radix FFT. Split-radix FFT algorithm combines simplicity of radix-2 FFT with less 
computational complexity radix-4 FFT. 

The advantage of split -radix FFT is that it has considerably fewer number of arithmetic computations compared to 
that of radix-4 and radix-2 FFT. Split-radix also has several other advantages such as regular structure, no reordering of 
internal signals except for outputs, etc. 

The split-radix algorithm improves the arithmetic complexity of the Cooley-Tukey algorithm by further 
decomposing the odd parts into odd odd and odd-even parts, while the even parts are left alone because they have no 
multiplicative factor. Even-numbered samples of the Appoint DFT can be calculated by radix-2 



X (2K) = ^ 2 o _1 [ x(n) + x (n + £)] W# 2 K=0,1 N/2-1 



The odd-numbered samples x (2K+1) requires an additional multiplication of . To implement this, radix-4 
algorithm is used for its lesser computational complexity. Using radix-4 algorithm for the odd -numbered samples of the 
JV-point DFT, the following N/4-point DFTs are 



X (4K+1) = YlLV 1 [ x(n) - jx (n + J) - x (n + ") + jx(n + \ N)] W N " \N\ 



nk 

N/4 
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Where K=0,1 N/4-1 and 

X (4K+3) = ZlLV [ x(n) + jx (n + J) - x (n + 2) - jx(n + \ N)] W N 3n W£% 
Where K=0, N/4-1 

Hence, the Appoint DFT now has been decomposed into one N/2 -point DFT without phase factor and another two 
Af/4-point DFTs with phase factor. Figure 1 shows the split-radix butterfly unit. 




ttf 

Figure 1: Split-Radix Butterfly Unit 



In this paper we implemented the split-radix fft for 32 input. The below figure 2 represents the architecture for 
32input split-radix .In every block represents split-radix unit 

FPGA 

In this paper the proposed 32-input split radix has been simulated and synthesised in using the Xilinx design suite 
13.2. In this proposed paper the device family is vertex-7. 

About vertex-7 

7 series FPGAs are offered exclusively in thermally efficient flip-chip BGA packages. These0.5 mm, 0.8 mm, and 
1.00 mm flip-chip packages range in pin-count from the smallerlO x 10 mm CPG236 to the 45 x 45 mm FF1930. The suite 
of packages is used to address the various power requirements of the 7 series devices. All 7 series devices are implemented 
in the 28 nm process technology. 

Target Device : xc7kl60t-3-fbg676 

Packaging details: FF (G) 676 




Figure 2: 32 Input Split-Radix Implementation Diagram 
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SIMULATION RESULTS 

The below diagram figure 3 represent RTL view of internal split radix block 




Figure 3: RTL View of Internal Split- Radix FFT 

The below figures 5,6,7,8 represents the simulation results of internal split-radix blocks. The inputs given in the 
form both real and floating point numbers. Inputs are from reall to real32 & imaginaryl to imaginary32. The out puts are 
form real_l to real_32& imaginary _1 to imaginary_32 




Figure 4: Simulation Results of First Split-Radix FFT Block 
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Figure 5: Simulation Results of Second Split-Radix FFT Block 
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Figure 6: Simulation Result of Third Split-Radix FFT Block 
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Figure 7: Simulation Result of Fourth Split -Radix FFT Block 

SYNTHESIS REPORT 

The final synthesis report for 32-point SRFFT for both real and floating point inputs is given below the device 
utilization values & time delay for proposed paper 

Table 1: Device Utilization Summary of Vertex-7 for 32-Point SRFFT 
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LUT-FF pairs 








Number of 
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19" 
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31% 
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Table 2: Time Delay of 32-Point SRFFT 
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